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ABSTRACT:  The  starting  point  of  this  investigation  is  the .pro-  A 

- t  ?  '  \  _ 

perties  of  restricted  quadratic  top0b,  (y  Ax,  x £  S5&® ,  where  A 


is  an  ap<m  real  symmetric  matrix',  and  S  is  a  subspace.  The  index 


theory  of  Hestenes  (1951)  and  Maddocks  (1985)  that  treats  the  more 


general  Hilbert  space  version  of  this  problem  is  first  specialized 

to  the  flnlte7^1menslonal  context,  and  appropriate  extensions, 

y 

valid  only  in  f inite/di mens Ions ,  are  made.  The  theory  is  then 


applied  to  obtain  various  Inertia  theorems  for  matrices  and  posi¬ 


tivity  tests  for  quadratic  forms.  Expressions  for  the  Inertias  of 


divers  symmetrically  partitioned  matrices  are  described.  In  par¬ 


ticular,  an  inertia  theorem  for  the  generalized  Schur  complement 


is  given.  The  investigation  recovers,  links  and  extends  several, 


formerly  disparate,  results  in  the  general  area  of  inertia 
theorems .  -  Qf  |Q 
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§  1 .  Introduction 


The  main  subject  of  this  presentation  is  the  properties  of 
a  quadratic  form  defined  on  Ra  by  an  m*m  real  symmetric  matrix 
A  when  attention  is  restricted  to  a  given  subspace  Scf>®.  if 
the  subspace  S  is  actually  the  whole  of  SR®,  the  essential  pro¬ 
perties  of  the  quadratic  form  are  encapsulated  in  the  inertia  of 
the  matrix  A,  denoted  In  A,  namely  the  triple  comprising  the 
number  of  positive,  negative,  and  zero  eigenvalues.  It  will  here 

be  shown  that  this  concept  of  inertia  can  be  usefully  generalized 

« 

to  obtain  a  triple  In  (S;A),  depending  on  both  the  matrix  A  and 
the  subspace  S,  that  captures  the  properties  of  the  quadratic 
form  xT  Ax  restricted  to  S. 


It  is  then  shown  that  In  (S;A)  is  intimately  connected  to 
*  A  A 

In  A  and  In  (S  ;A),  where  S  is  the  subspace  that  is  A-ortho- 
gonal  to  S.  The  main  idea  is  that  the  properties  of  xrAx  on  S 
are  completely  determined  if  the  properties  on  PB  and  on  SA 
are  known.  This  result  is  then  applied  to  obtain  Inertia  theorems 
for  matrices  of  the  type  BTAB,  and  for  symmetrically  partitioned 
matrices.  Particular  emphasis  is  given  to  the  matrix  construction 
known  as  the  generalized  Schur  complement.  Tests  concerning  posi-/'"©^ 


I  COpy  I 

tivity  of  A  on  subspaces  are  also  described.  / 

ftSs'O' 

Some  of  the  results  stated  here  are  necessarily  complicated. 

» 

They  comprise  equalities  between  several  different  indices  or  "  "g 

dimensions,  and  it  seems  unlikely  that  many  of  these  Indices  will**  q 
be  readily  calculable  in  concrete  applications.  However,  all  of  -  — . 
the  results  obtained  here  Include  known  results  as  special  cases. 

_^UabllUy  0«hu7 


If'1, 


Special 
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Invariably,  these  prior  results  comprise  situations  in  which 
several  of  the  Indices  are  known.  Thus  the  theory  developed  here 
casts  light  on  the  necessity  of  hypotheses  and  assumptions  made  in 
previous  analyses.  Moreover,  a  considerable  unification  is 
achieved  by  the  construction  of  connections  between  previously 
unrelated  works. 

The  presentation  is  structured  as  follows.  In  §2,  the  index 
theory  of  Hestenes  (1951)  and  Maddocks  (1985),  which  was  derived 
in  the  context  of  the  isoperimetric  calculus  of  variations,  is 
specialized  to  finite-dimensions,  and  the  appropriate  extensions 
are  aade  in  order  to  obtain  inertia  theorems.  Then,  in  §3,  the 
theory  is  reformulated  in  terms  of  the  Moore -Penrose,  or  general¬ 
ized,  Inverse  of  A.  Connections  between  In  A  and  In  BT AB  are 
described  in  $4.  Of  course,  the  classic  result  known  as  Sylvester's 
Law  of  Inertia  is  recovered  as  a  particular  case.  Attention  is 
turned  to  partitioned  matrices  in  §§ 5  and  6.  Preliminary  results 
are  given  in  55,  and  the  generalized  Schur  complement  is  discussed 
in  16.  Theorems  of  Morse  (1971)  and  of  Han  (1986)  are  also  dis¬ 
cussed  here.  Finally,  in  $7,  the  particular  question  of  positivity 
on  a  subspace  is  treated. 

It  should  be  emphasized  that  the  theory  given  here  overlaps 
with,  and  builds  on  the  analysis  of  many  authors.  Because  there 
are  so  many  connections  with  prior  works,  I  do  not  attempt  detail¬ 
ed  attributions  here  in  the  introduction.  Full  discussion  and 
referencea  are  given  at  the  appropriate  junctures  throughout  the 
body  of  the  text.  The  interrelations  of  this  work  and  prior 
analyses  are  also  summarised  in  «8. 
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§ 2 .  General  Results  Involving  A-orthoaonal  Complements 

The  content  of  this  section  is  a  finite-dimensional  version 
of  the  theory  of  restricted  quadratic  forms  derived  in  Maddocks 
(1985,  §2),  which  in  turn  is  a  development  of  the  theory  of 
Hestenes  (1951).  Considerable  changes  in  notation  and  emphasis 
are  made  in  order  to  facilitate  applications  to,  and  comparison 
with  the  theory  of  symmetric  real  matrices.  Consideration  of  the 
finite-dimensional  case  also  allows  various  extensions  of  the 
theory.  The  scope  of  the  development  given  here  is  limited  to  the 
material  required  to  understand  the  statement  of  the  main  result, 
namely  Theorem  2.6.  In  particular  the  complete  proof  of  Theorem 
2.6  is  not  given.  The  steps  omitted  here  can  be  found  in  Maddocks 
(op.  clt.). 

Consider  a  real  symmetric  mvm  matrix  A.  He  shall  study  the 
properties  of  the  quadratic  form  Q(x)  »  xT Ax  for  x  e  S  a  sub¬ 
space  of  P  .  The  main  focus  of  our  attention  is  the  connection 
between  the  properties  of  Q  restricted  to  S  and  of  Q 
restricted  to  other  related  subspaces. 

It  is  apparent  that  the  properties  of  Q  on  S  are  inti¬ 
mately  connected  to  the  inertia,  In(M)  *  (n(M),  *>(M),  6(M)),  l.e. 
the  triple  comprising  the  number  of  positive,  negative,  and  zero 
eigenvalues,  of  the  symmetric  matrix  M  *  BTAB.  Here  B  is  any 
m*m  matrix  whose  range  la  S.  Accordingly,  the  results  presented 
here  can  later  be  couched  in  terms  of  inertia  theorems  for  certain 
matrices . 
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The  following  notion  of  A-orthogonality  will  appear  through¬ 


out. 


Definition  2.1:  Two  vectors  Xj ,  x2  e  P®  are  termed  A-orthogonal 
If 

XT^AX2  «  0. 


Remarks . 

(a)  As  A  is  symmetric,  the  relation  is  symmetric. 

(b)  Any  vector  is  A-orthogonal  to  any  element  of  the  kernel 
or  nullity  of  A.  which  is  here  denoted  N(A). 

(c)  If  either  of  the  vectors  is  an  eigenvector  of  A  not  in 
N( A) ,  then  orthogonality  and  A-orthogonal ity  are  equi¬ 
valent  . 

(d)  Whenever  A  is  neither  positive  nor  negative  semi- 
definite,  there  exist  vectors  x  such  that  Ax  *  0, 
but  xtAx  *  0. 

(e)  The  concept  of  A-orthogonal ity  extends  to  subspaces  in 
the  obvious  way.  The  A-orthogonal  complement  of  a 
subspace  S  will  be  denoted  S*.  It  should  be  noted 
that  for  any  subspace  S,  H(A)cS*. 

(f)  Because  of  remark  (d)  above  it  is  possible  that 
SnSKnR{A)  *  (0). 

(g)  A  useful  characterization  is  provided  by 

S*  »  {y:  Ay  e  SA)  ■  (y:  xT  Ay  ■  0,  Vx  e  S). 


This  property  of 

•s 


can  be  used  to  derive  simple  results,  such 


-  S  +  11(A), 
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and 

(SAnTA)A  -  S  +  T  +  N(A) . 

Lemma  2.1.  Any  subspace  of  P®  has  a  mutually  A-orthogonal  basis. 

Proof .  Consider  any  basis  w^,  1  *  l,...,p.  Then,  as  A  is 
symmetric,  the  matrix  W  *  {w^} 

wij  “  W1A  wj' 

is  symmetric.  Therefore  there  exists  an  orthogonal  n<n  matrix 
R  *  (r^)  such  that  PT  AP  is  diagonal.  The  set 

{v  ui  ■  l  rW 

is  a  basis  because  P  is  nonsingular,  and  by  construction  it  is 
also  mutually  A-orthogonal . 

Remark.  Because  A  is  not  necessarily  definite,  the  obvious 
generalization  of  the  Gram-Schmldt  procedure  does  not  work.  In 
particular  it  is  not  in  general  possible  to  construct  an 
A-orthonormal  basis. 

Lemma  2.2.  Let  B  be  an  a«n  matrix  of  rank  p.  Let  (w^) 

1  *  l,...,p,  c  p",  be  any  basis  for  S  -  R(B) ,  the  range  of 

B.  Define  the  p*p  matrix  W  by  W  ■  (w^)  •  (W^AWj).  Then 

XnBTAB  ■  XnW  +  (o,o,n-p).  (2.1) 

In  particular,  if  («t^)  is  a  mutually  A-orthogonal  basis,  then 
the  Inertia  of  the  n»n  symmetric  matrix  BTAB  is  determined  by 
the  inertia  of  the  p*p  diagonal  matrix  Dlag  (w^Aw^). 
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Proof »  The  Poincare  variational  characterization  of  the  eigen¬ 


values  of  a  self-adjoint  operator  (e.g.  Weinberger,  1974)  implies 
that  MM),  the  number  of  negative  eigenvalues  of  a  symmetric 
matrix  M,  coincides  with  the  largest  dimension  k  such  that 
xtKx  Is  negative  for  all  x  in  a  subspace  of  dimension  k.  This 
result  can  be  applied  both  to  the  n*n  matrix  BT AB  and  to  the  p*p 
matrix  W  «  { vTj  A  v  i } .  But  any  k  dimensional  subspace  of  Pn  on 
which  Bt  AB  is  negative,  provides  a  k  dimensional  subspace  of 
on  which  W  is  negative  and  vice  versa,  via  the  following 
construction.  Suppose  (x^)  i  *  l,...,k,  e  pn  is  a  basis  of 
a  negative  subspace  of  BTAB.  Then  Bx^  *  0,  and  as  (v^)  is  a 
basis  of  R(B) ,  3  an  m*p  matrix  K  *  (kjj)  of  rank  k  such 

that  Bx^  »  £  kijVj*  ®y  construction,  the  rows  of  K  span  a 
negative  subspace  of  dimension  k  in  Pp.  Thus  MB*  AB)  ■  MW). 

In  a  similar  fashion  it  can  be  shown  that  n( BT AB)  -  «(W). 

The  last  equality  contained  in  (2.1)  is  then  easily  obtained  once 
it  is  remarked  that  for  any  q*q  matrix  M 

*(»)  ♦  Mm)  +  6  (m)  -  q.  (2.2) 

Remark .  When  the  properties  of  the  quadratic  form  xTAx,  x  c  R(B) 
are  under  consideration,  it  is  natural  to  consider  the  eigenvalue 
problem 

BTABy  ■  XBTBy,  By  0  0. 

It  is  eigenvalues  in  this  sense  that  determine  whether  xT  Ax  is 
positive  definite  on  R(B).  Thus,  zero  eigenvalues  of  BT AB  that 
have  an  eigenvector  y  c  N(B)  are  not  of  concern.  The  modified 
eigenvalue  problem  leads  to  a  modified  inertia 
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In*(BTAB)  «  In(BTAB)  -  (0,0,  dim  N(B) )  . 

Accordingly,  equation  (2.1)  appearing  In  Lemma  2.2  can  be  rewritten 
in  the  form 

In*(BTAB)  «  InW.  (2.3) 

Although  we  shall  not  pause  to  prove  it,  the  modified  inertia  also 
measures  the  number  of  positive,  negative  and  zero  eigenvalues  A 
of  the  problem 

X  e  R(B),  Ax  -  Axe  N  { B*  )  . 

As  an  immediate  Corollary  to  Lemma  2.2,  we  have 

Corollary  2.3.  If  B ^  is  m^n,  B 2  is  m*/  and  RfBj)  *  R(B2) 
then 

In  B^ABj  +  (0,0,  t  -  n)  •  In  BjA B2 

Remark.  If  the  matrices  Bj  and  B2  are  square  and  of  full 
rank,  l.e.  m  ■  n  »  /  «  p.  Corollary  (2.3)  reduces  to  the 
celebrated  result  known  as  Sylvester's  Law  of  Inertia. 

Because  of  Lemma  2.2  it  is  practical  to  associate  an  Inertia 
with  the  subspace  S. 

Definition  2.2.  The  inertia  of  a  subspace  S,  denoted  In  (S;A) 
or  In  (S),  is  defined  by 

In* (S;A)  -  In  W, 

where  W  is  the  matrix  defined  in  Lemma  2.2.  By  the  remark 
following  Lemma  2.2  and  equation  (2.3) 

In*(S;A)  •  In*(BTAB) 
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where  B  Is  any  m*n  matrix  whose  range  is  S.  Obviously  the 
quadratic  form  Q(x)  *  xT Ax  satisfies  xT Ax  2  0,  Vx  e  S  if  and 
only  if  v  (S; A)  ■  0,  with  analogous  result  holding  for  posi¬ 
tivity,  nonpositivity  etc. 

Definition  2.3.  For  a  subspace  ScR®  the  relative  nullity  of  A 
on  S,  denoted  d'(S;A)  or  just  d°(S),  is  defined  by 

d’(S;A)  «  dim  (SnSAnR(A)).  (2.4) 

Remark .  In  Maddocks  (1985)  the  relative  nullity  is  given  a 
different  definition  and  (2.4)  has  the  status  of  a  theorem.  In 
the  work  of  Hestenes  as  is  reported  in  Gregory  (1980,  §2),  the 
relative  nullity  is  given  yet  another  equivalent  definition. 

Lemma  2.4.  For  any  m^m  matrix  A,  and  subspace  ScR®  the  relative 
nullity  d°(S;A)  and  6*(S;A),  (defined  in  Definition  2.2)  are 
related  by 

6* (S;A)  -  dim(SnSA)  -  d*(S;A)  ♦  dim(N(A)nS)  (2.5) 

Remark .  The  quantity  dim(SoSA)  is  sometimes  called  the  nullity 
of  A  on  S . 

Proof .  The  second  equality  in  (2.5)  follows  immediately  from 
Definition  2.3  and  the  facts  that  R( A)  -  (M(A) )x  and  N(A)cSA. 

To  prove  the  first  equality  consider  an  A-orthogonal  basis 
{Sj},  1  *  l,...,p  of  S,  the  existence  of  which  is  guaranteed  by 
Lemma  2.1.  Then,  by  definition, 

6*(S;A)  -  6(W), 
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where  6 (W)  is  the  number  of  basis  vectors  s^  satisfying 


8iAsl 


0. 


Contrariwise,  any  x  e  SoSA  can  be  written  in  terms  of  the 
basis  {Sj)  of  S  as 

x  -  l 
1 

but  for  x  to  be  in  SA 


ai<8i'A8i>  *  °* 

T  A 

Thus,  those  s^  satisfying  s^As^  *  0  comprise  a  basis  of  SoS  , 
and  consequently, 

6(W)  -  dim(SoSA) . 


Lemma  2.5.  Let  S  and  Y  be  subspaces  of  P*  with  S^RtA)  c  Y^RCA 
and  YnY^RfA)  -  {0}.  Then 

d* (S; A)  -  d° ( SAnYA ; A) . 


Proof .  By  definition 

d* (S;A)  -  dim{S"SA"R(A)) 

and 

d* (SAoYA ; A )  -  dim{SAnYAn(SAnYA)ArR(A) ) . 

It  was  earlier  remarked  that 

(SAnYA)A  •  S  ♦  Y  ♦  H(A) , 

and  by  hypothesis  SnR(A)  c  YAnR(A) ,  and  YnYAnR(A)  ■  (0). 
Consequently, 

d*  (8AnYA ; A)  -  dla{  (S  ♦  Y)osAnyAoR{ A) )  -  dim{SnSAnYAnR( A) ) 

-  dla(SoSAnR( A) )  >  d* (S; A) . 
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Remark.  The  hypotheses  on  the  subspace  Y  appear  unduly 
restrictive,  but  they  encompass  the  important  case  Y  -  {0},  in 
which  we  obtain  the  result 

d#(S;A)  =  d° ( SA; A) . 


Theorem  2.6.  Let  A  be  an  m*m  real  symmetric  matrix,  and  let  S 

and  Y  be  subspaces  of  IRm,  with  SnR(A)  c  YAnR{A) ,  and 
YoY^RtA)  -  {0}.  Then 

In* ( YA ; A)  «  In*(S;A)  +  In* ( SAn YA ; A) 

+  (d#  (s) ,  d#(s),  -  d°(s)  -  dim{Sr>SA) ) .  (2.6) 

A  *  o 

The  quantities  S  ,  In  and  d  were  introduced  in  Definitions 
2.1,  2.2  and  2 . 3 


Proof .  Identity  (2.6)  assets  three  equalities.  The  equality 

p*(YA;A)  -  **(S;A)  ♦  P*(SAnYA;A)  +  d*(s)  (2.7) 


is  a  restatement  of  Theorem  2  of  Naddocks  (1985)  applied  to  the 
operator  L  ■  PAP,  where  P  is  orthogonal  projection  onto  the 
subspace  YA.  The  equivalence  is  apparent  once  it  is  remarked 
that  the  number  of  negative  eigenvalues  of  PAP,  denoted  o~(PAP), 
is  just  v*  ( yA ; A) ,  that  the  index  d“(S)  is  equivalent  to  t^*(S), 


that 


and  that 


SPAP  -  (SAnYA)  u  (YA)x, 
SnSA  ■  SnSAnYA  ■  S"SPAP. 


(2.8) 

(2.9) 


Similarly,  consideration  of  the  operator  L  *  -PAP  provides 
the  equation 
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n'fY^A)  =  «* (S ; A)  +  «*(SAnYA;A)  +  d° (s) . 

The  final  equation  can  be  obtained  as  follows. 

Three  applications  of  Lemma  2.4,  yields 

6*(YA;A)  =  d°(YA;S)  +  dim{K(A)nYA) 

6  * ( S ; A)  »  dim{SnSA) 

and 

6*(SAnYA;A)  =  d°(SAnYA;A)  +  dim{N( A)oSAnYA) . 

Now,  the  A-orthogonal  complement  of  any  subspace  contains 
and  by  hypothesis 


(2.10) 


(2.11) 

(2.12) 

(2.13) 
N{  A)  , 


dim(YnYA nR{ A) )  =  d°(YA;S)  *  0. 
Accordingly  (2.11)  and  (2.13)  reduce  to 
6*(YA;A)  -  dim  N(A) , 

6*(YA;A)  «  6  * ( SAnYA ; A)  -  d°(SAnYA;A). 


and 


Lemma  (2.5)  and  equation  (2.12)  then  imply  the  equation 

6 * ( YA ; A )  «  6 * ( S ; A)  ♦  6*(SAnYA;A)  -  d°(S;A)  -  dim(SoSA) 


as  required. 


Remark.  Theorem  2.6  will  be  applied  in  §6,  and  the  full  result 

stated  above  will  be  required.  Nevertheless,  for  most 
applications  the  case  Y  ■  (0)  suffices.  The  special  case  is 
sufficiently  ublquituous  in  the  sequel  that  it  is  formalized  as: 

Corollary  2.7.  Let  A  be  an  rn  real  symmetric  matrix,  and  let 
S  be  a  subspace  of  R*.  Then 

In  A  -  In* (S; A)  +  In*(SA;A)  +  (d* (s) ,d* (s) ,  -  d°(s)  -  dim(SnSA)). 
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Proof .  The  subspace  Y  *  (0)  satisfies  the  hypotheses  of  Theorem 
2.6,  for  then  YA  *  Rm.  Moreover,  the  Inertia  of  the  subspace  Rm 
coincides  with  the  inertia  of  the  matrix  A. 


Remark .  Han  &  Fujiwara  (1985,  Theorem  2.3),  obtained  Corollary 
2.7  in  the  further  special  case  d° (s)  *  0.  Their  result  includes 
the  additional  hypothesis 


so  that 


SnSA  c  N( A) 

d°(s)  =  dim{SnSApR(A) )  *  0. 


Han  &  Fujiwara 's  development  was  independent  of  the  works  of  Hes- 
tenes  (1951)  and  Maddocks  (op.  cit.),  and  they  actually  adopt  a 
different  definition  for  the  quantity  that  is  here  denoted 
In* (SA;A) . 


SA  *  span(s,0),  R(A)  ■  R2,  d°(S;A)  *  1  «  dim(Sr'SA), 
(1,0)tA(1,0)  *  (1,0)T(0,1)  «  0,  and  In*(S;A)  «  In*(SA;A)  « 
(0,0,1)  . 


Consequently,  Corollary  2.7  predicts  that 

InA  «  (0,0,1)  +  (0,0,1)  +  ( 1 , 1 , -2 )  «  (1,1,0), 
and  of  course  the  eigenvalues  of  A  are  ±1. 


Corollary  2.8  can  be  applied  to  clarify  the  geometrical  role 
played  by  d*(S;A).  Because  the  sum  of  the  components  of  an 
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Inertia  equal  the  dimension  of  the  space,  we  find  from  Corollary 
2.8  that 

m  «  dim  S  +  dim  S*  +  d°(s)  -  dim{SnSA). 

Mt  fc 

Consequently,  R  can  be  decomposed  as  the  sum  of  S  and  S 
precisely  if  d°(s)  *  0.  The  decomposition  is  a  direct  sum  if  the 

A 

stronger  condition  dim{SnS  }  «  0  is  satisfied. 

§3.  Results  Utilizing  the  Generalized  Inverse  of  A 

The  results  of  §  2  are  direct  in  the  sense  that  they  do  not 
Involve  any  inversion  of  the  matrix  A.  It  will  here  be  shown 
that  when  the  Moore-Penrose,  or  generalized.  Inverse  of  A  is 
introduced.  Corollary  2.7  can  be  reformulated  to  emphasize  the 
symmetry,  or  duality,  between  the  subspace  S  and  the  subspace 
S"L ,  the  usual  orthogonal  complement  of  S. 

Definition  3.1.  For  an  m*n  matrix  C,  the  generalized,  or 
Moore-Penrose  ,  Inverse  is  the  unique  nvm  matrix  C+  satisfying  the 
four  conditions: 


(1) 

CC+C  ■ 

C 

(ii) 

C+CC+ 

-  c+ 

(ill) 

(CcV 

»  cc+ 

(iv) 

<C*C)T 

«  c+c. 

Geometrically,  C*  can  be  regarded  as  the  Inverse  of  the 
Invertible  operator  that  is  obtained  when  C  is  restricted  to 
domain  R(CT),  and  range  R(C).  It  should  be  remarked  that 

N(C+)  -  N(CT  )  and  R(C+)  -  R(CT). 
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As  a  siaple  consequence  of  the  definition  we  have  that 


(CT  )+  -  {C+)T  . 

When  A  is  symmetric,  A+  is  also  symmetric,  and  conditions 
(Hi)  and  (iv)  both  reduce  to 

A+A  *  AA+. 

Moreover,  direct  consideration  of  the  standard  eigenvalue  problem 
demonstrates  that 

In  A  *  In  A*  (3.1) 


Theorem  3.1.  Let  A  be  an  m*m  real  symmetric  matrix,  and  let  S 
be  a  subspace  of  Rm.  Then 

InA  *  In*  (S;A)  +  In*(SA;A+)  +  (d,d,  -  2d),  (3.2) 


where 


d  «  dim(SnA+SA)  «  dim(ASnSA). 


The  triple 


In 


was  introduced  in  Definition  2.2. 


(3.3) 


Proof.  Consider  Corollary  2.7  and  notice  that 

In*(P®;A)  »  InA 


and  that 


SA  «  N( A)  ®  A+Sa, 


(3.4) 


the  sum  being  direct.  Then  (2.6)  can  be  written  as 

InA  -  In*(S;A)  ♦  In* (N(A)  +  A+SX;A) 

♦  (d,d,  -d  -  dlm(Sn (N( A)  +  A*(Sa))))  (3.5) 

where  d  is  given  by  (3.3).  Because  the  subspaces  N(A)  and 
A+Sx  are  mutually  orthogonal  and  mutually  A-orthogonal , 


(3.6) 


In* (N(A)  +  A+Sa;A)  -  In*  (A^S1  ;A)  +  (0,0,  dim  N(A) ) 

Moreover,  by  Definition  2.2  of  In*,  end  property  (il)  of 
the  definition  for  A+, 

In* (  A+Sa ; A)  «  In* ( SA ; A+)  -  (0,0,  dim(N(A)nSA ) ) .  (3.7) 

Now 

din (So (N(A)  ♦  A+Sa)>  «  din(SnN(A) )  +  dim(SnA+SA), 

and 

dim(N(A)oSx }  +  dim(N(A)nS)  •  din  N(A) . 

Consequently,  substitution  of  (3.6)  and  (3.7)  in  (3.5)  yields  (3.2) 
as  required,  where  d  is  defined  by  the  first  equality  in  (3.3). 
The  second  equality  follows  because  SnA+SA  c  R(A) ,  and  when 
restricted  to  the  range  of  A,  AA+  acts  as  the  identity.  That  is 

dim(SnAV-)  «  din(ASoAA+SA)  *  din(ASnSA). 

Han  &  Fujlwara  (1985,  Corollary  2.5),  obtained  the 
special  case  of  Theoren  3.1  that  arises  when  the  addi¬ 
tional  two  conditions  of  A  being  invertible  and  d 
vanishing,  are  inposed  as  hypotheses. 

(b)  The  roles  of  S  and  SA ,  and  A  and  A+  can  be 

permuted  in  the  proof  of  Theorem  (3.1)  to  yield  three 
other  analogous  equations.  However,  in  light  of  (3*1) 
and  (3.3)  no  new  information  is  obtained. 


•  Connections  between  In  A  and  In  BT  AB 

As  before  A  denotes  a  real  a*m  symmetric  matrix,  and  B 
denotes  a  real  m*n  matrix.  Recall  that  In(BY  AB)  Is  the  triple 
comprising  the  number  of  positive,  negative  and  zero  eigenvalues  of 
the  standard  eigenvalue  problem 

Br  ABx  *  AIx,  x  e  Rn, 

whereas  In*(BTAB)  is  the  triple  comprising  the  number  of 
positive,  negative  and  zero  eigenvalues  of  the  eigenvalue  problem 

Bt  ABy  -  A Bt  By ,  y  e  Rn,  By  »»  0. 

According  to  the  theory  of  4  2,  with  B  m*n, 

In(BTAB)  -  In*(BTAB)  ♦  (0,0,  n-p) ,  (4.1) 

where  p  is  the  rank  of  B,  i.e.  p  -  dimR(B). 

Furthermore,  if  S  *  R(B) ,  the  inertia  In*(S;A)  associated  with 
the  subspace  S  satisfies 

In* (S; A)  «  In*(BTAB). 

Consequently,  Theorem  (2.5)  and  (3.1)  can  be  applied  to  obtain 
inertia  theorems  for  certain  matrices. 

In  effect  it  only  remains  to  obtain  the  most  explicit 
expressions  for  quantities  such  as  S*  and  SnS*. 

Suppose  that  S  ■  R(B),  then 

8A  -  (y:  yT  Ax  •  0,  Vx  «  R(B) }  -  (y:  yT  ABz  -  0,  Yz  c  Rn). 

That  is  R(B)^  ■  (R(AB))X  ■  M(BTA).  It  is  also  convenient  to  note 
that 
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(4.2) 


N(BtA)  -  A+H(Bt  )  •  H(A) , 

the  toi  being  direct.  Here  A +  is  the  generalized  inverse 
defined  in  §3.  Equation  (4.2)  is  merely  a  restatement  of  (3.4), 
because  N(BT )  ■  R(B)X.  Consequent  upon  (4.2) 

dim(R(B)nR(B)A)  -  dim(R(B)n (A+W(BT  )  •  N( A) ) ) 

»nd  .  +  , 

d  (R(B) )  -  dim(R(B)nA  «(B  )). 

Application  of  Corollary  2.7  then  provides 

Corollary  4.1.  Let  A  be  an  m«m  real  symmetric  matrix,  let  B 
be  an  m*n  real  matrix,  and  let  C  be  any  m*q  matrix  whose  range 


is  N(BtA) 

Then 

In  A  -  In*BTAB  ♦  In*CT  AC  ♦  (d,d,  -  d  -  e) 

(4.3) 

where 

d  «  dim{R(B)oA+N(BT )) 

(4.4) 

and 

e  -  dim(R(B)oN(BTA)). 

(4.5) 

Remarks . 

(•) 

A  result  in  terms  of  In  BT  AB  and  In  CT  AC 

is  easily 

(b) 

obtained  by  exploitation  of  (4.1). 

Notice  that  d  ■  e  precisely  if  R(B)nN(A) 

-  (Q).  In 

particular  if  A  is  invertible,  d  »  e. 

(c)  One  possible  choice  for  C  is  the  m*m  matrix 
(X.  -  (BtA)*(BtA)). 


When  B  has  rank  a.  Corollary  (4.1)  can  be  simplified.  For 
then,  N(Bt  )  -  (fi)  and  R(B)  -  R*.  Accordingly,  d  -  0,  e  -  6(A), 
and  In*CT  AC  -  (0,0,  6(A)). 


IB 


Thus  we  have 

Corollary  4.2.  Let  A  be  an  ■'(a  real  symmetric  matrix,  and  B 
be  an  a*n,  n  t  m,  real  matrix  of  rank  m.  Then 

In  A  «  In  l;  AB  +  (0,0,  a-n) . 

Remark.  The  case  m-n  is  Sylvester's  Law  of  Inertia,  and  the 
case  n  >  a  is  merely  a  trivial  extension  that  could  be  obtained 
directly.  The  aore  complicated  result  embodied  in  Corollary  4.1 
indicates  that  there  can  be  no  direct  and  simple  extension  of  Syl¬ 
vester's  Law  of  Inertia  to  the  case  where  B  is  not  of  rank  m. 

In  a  similar  way  Theorem  3.1  can  be  translated  into  purely 
matrix  fora. 

Corollary  4.3.  Let  A  be  an  a«a  symmetric  matrix,  let  B  be  any 
a*n  matrix,  and  let  C  be  any  m*q  matrix  whose  range  is  N(Br  ). 
Then 

In  A  »  In*BTAB  ♦  InV  A+C  ♦  (d,d,  -2d), 

where 

d  -  dia<R(B)nA+N(BT  ))  -  dia(AR(B)nM(BT  )). 

One  possible  choice  for  C  is  the  m«m  matrix 
(I  -  (Bt  )V  )  -  I  -  BB+, . 

As  before,  a  result  in  terms  of  InBTAB,  InCTA+C,  and 
the  ranks  of  B  and  C,  is  easily  obtained  via  (4.1). 
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He  shall  hers  apply  the  theory  of  §13  and  4  to  obtain  results 


relating  the  inertias  of  certain  partitioned  Matrices  to  the 
inertias  of  subsatrlces. 


Leans  5.1.  Let  A  be  an  a«i  syanetrlc  matrix,  and  let  L  denote 
the  2a>2a  syanetrlc  matrix 


Then 


In  L  -  (a,m,  0) . 


Moreover , 


(5.1) 


(5.2) 


Proof .  Denote  the  eigenvalues  of  A  by  ,  i  ■  1 , . . .  ,a.  and 
denote  the  corresponding  elgenvsctors  by  u^.  Then  L  has  a 
positive  eigenvalues  X^  given  by  the  formula 

+  *  * 

> i  •  — . 


with  corresponding  elgenvsctors  (X^  y.y).1  Similarly,  L  has  m 
negative  eigenvalues 


with  eigenvectors  (X^  y,y). 


*Here  and  throughout  we  adopt  the  convention  that  when  x  and  y 
are  column  vectors,  (x,y)  is  also  a  column  vector. 


Remark.  Equation  (5.1)  is  widely  known,  see  for  example 
Chabrillac  &  Crouzeix  (1984),  Cottle  (1974)  and  references 
therein.  The  explicit  proof  by  exhibition  of  the  eigenvalues  and 
eigenfunctions  that  is  given  here  will  be  exploited  later. 


s.a.  Let  A  be  an  m*m  symmetric  matrix  and  let  B  be  an 
m*n  matrix  of  rank  p.  Then 


(m-d,  m-d,  n-p  +  2d)  -  In*(K(BT);  -A), 


(5.3) 


where 


d  -  di»(N(A)^N(BT  ) )  +  dim(AN(BT  )0R(B)) 

«  dim  (N(Bt  )oH(Bt  )A)  -  5*(N(BT  )  ;A) 


(5.4) 


Proof .  Apply  Theorem  3.1  with  the  matrix  L  of  Lemma  5.1  playing 
the  role  of  A  and  the  eubspace  S  being  (F*,R(B)).  Then 

InL  -  In*  ♦  Xn*((Q||,ll(BT  ));  L_1)  ♦  (d,d,  -2d)  (5.5) 

where  _  _ 

d  «  dim((ff>  ®,R(B))nL  1(0,N(BT  ))). 


But  Lemma  5.1  provides  L_1  explicitly,  and  consequently,  for 
y  cN(Bt  ) ,  l”1  (0,y)  s  (f?",R(B))  whenever  yc  N(  A)  or  Aye  R(B) . 
Consequently,  d  is  given  by  the  first  equality  in  (5.4).  The 
eecond  equality  follows  from  (3.4),  and  the  third  equality  is  then 
given  by  (2.5) . 

Furthermore, 

In*((Cm'H^B,  )>;  L"1}  ’  ’A>' 


from  (4.1) 


(A  Bl  ,ri_  0  1 

[A  I]  1 

F1. 

°1  TA  B1 

[b'  oj  "  In  [o  btJ 

k  cJI 

m 

0 

Bj  *  In[BT  Oj  "  (0*°*  n”p)* 

from  Lemma  5.1 

InL  ■ 

(m. 

m,0) . 

Substitution  Into  (S.5)  then  yields  (5.3). 


Remark .  Chabrlllac  &  Crouzelx  (1984)  obtained  a  result  related  to 
Lemma  5.2.  Detailed  discussion  of  the  connection  is  deferred  to 
$7.  Han  6  Fujiwara  (1985,  Theorem  3.4)  obtained  the  special  case 
that  arises  when  the  additional  hypothesis  d  «  0  is  imposed. 

Then 

In*  (N(Bt  ) ; -A)  -  (*>*(N(BT  );A),  n*  (N(BT  )  ;A)  .0) . 


and  because  p*(N(Bt);A)  +  «r*(M(BT);A)  *  *-p, 
be  rewritten 

fA  B1  , 

InLr  J  -  (p,p.n-p)  ♦  In  (N(B  );A)  » 


'0  1  O' 

'  0  * 

Example  5.3.  Take  A  ■ 

10  1 

0  1  0 

,  B  * 

1 

.  0  _ 

equation  (5.3)  can 


lp+n  ,p-w>*,n-p) , 


Then  a  ■  3,  n  ■  1,  and  p  ■  1.  Moreover,  H(BT )  ■  (a,  0,0), 
R(B)  «  (0,r,0)  a,0,r  «  R  and  AH(BT  )  •  (0,a+S,0).  Further¬ 
more,  because  N(A)  ■  (a,0,-a),  a  c  R,  d  ■  2.  It  is  also 
apparent  that  In*(N(BT);A)  ■  (0,0,2),  because 
(a.O.B)’ A(a,0,5)  -  0.  Thus  Lemma  5.2  can  be  applied  to  find 
that 
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0  10  0 

In  1  0  1  1  «  (3-2,  3-2,  1-1+4)  -  (0,0,2)  «  (1,1,2), 

0  10  0 
0  10  0 

which  result  can  be  verified  directly. 

Leama  5.2  is  somewhat  unsatisfactory  in  the  sense  that  the 

®  T 

quantity  In  (N(B  );-A)  is  in  general  undetermined.  Accordingly, 
special  cases  are  of  interest. 


Corollary  5.4.  Add  the  condition  p  ■  a  to  the  hypotheses  of 
Theorem  5.2.  Then 

In[BT  0]  *  {5*6) 

Proof .  When  p  *  m,  N(BT  )  -  (0). 

The  case  n  -  a  is  well  known.  The  case  n  >  a  can  be 
obtained  directly,  without  recourse  to  Lemma  (5.2).  See,  for 
example,  Cottle  (1974)  and  references  therein.  A  less  trivial 
special  case  is 

Corollary  5.5.  Add  the  condition  N(BT  )cN(A)  to  the  hypotheses 
of  Theorem  5.2.  Then 

[A  Bl 

InlB»  0J  ■  (p.p,n  ♦  a  -  2p). 

Proof.  When  K(BT)cN(A), 

In*(W(BT  )  ;-A)  -  (0,0,  dim  N(BT)). 

and  d  *  dim  B(BT  )  *  a  -  p. 


16.  Partitioned  Matrices  and  the  Generalized  Schur  Complement 


In  this  section  M  denotes  an  real  symmetric  matrix  that 
Is  partitioned  in  the  fora 

M-[*,  “].  <••*> 

where  A  and  C  are  n«m  and  n*n  real  symmetric  matrices,  and  B 
Is  an  m^n  matrix  of  rank  p.  Following  Carlson  et  al  (1974),  we 
have 

Definition  6.1.  The  generalized  Schur  complement  of  A  In  M, 
denoted  M/A,  Is 

M/A  -  C  -  BtA*B,  (6.2) 

where,  as  before,  A+  denotes  the  Moore-Penrose  inverse  of  A. 
Whenever  A  is  Invertible  A+  «  A-1,  and  (6.2)  reduces  to  the 
standard  definition  of  the  Schur  complement  of  a  symmetrically 
partitioned  matrix. 

In  this  section  the  preceding  development  will  be  applied  to 
obtain  certain  equalities  Involving  the  inertias  of  M,  A  and  M/A. 
These  equalities  subsume  Inequalities  obtained  by  Carlson  et  al 
(op.  clt.),  as  well  as  the  classic  result  that  applies  when  A  is 
nonsingular  (vide  infra).  Theorems  due  to  Morse  (1971)  and  to  Han 
(1986)  will  also  be  discussed. 

We  first  apply  Corollary  2.7,  with  S  «  (R(A),On).  Here,  as 
before,  (x,y)  denotes  a  column  vector.  Thus 

SM  «  ( (x,y) :  Ax  +  By  c  N( A) ) 
or 

SM  -  (~A*B  Fn,Fn)  S  (N(A),2n), 
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and  SnS  *  (Q) ,  so  d  (s)  *  0.  Consequently, 


InM  =  In*(R(A) ;N)  +  In*(SM;M). 


(6.3) 


It  is  apparent  from  Definition  2.2,  with  the  underlying  choice  of 


basis  taken  from  eigenvectors  of  A,  that 


In  <R(A);M)  -  (n(A) ,  *(A),  0) . 


(6.4) 


Theorem  (2.6)  is  next  applied  to  the  subspace 


Q  «  (-AtBR(M/A),  R(M/A)) 


M  MM 

regarded  as  a  subspace  of  S  .  Consequently,  the  subspace  S  oQ 


must  be  determined.  The  calculation  is  as  follows.  An  element  of 


SMoQM  Is  of  the  form 


(X  -  A+Bz.Z),  X  €  N( A) ,  Z  e  Rn, 


where 


(x  -  A+Bz,z)t  M(-A+By,  y)  *  0,  Vyc  R(M/A) . 


(6.5) 


Block  multiplication  and  the  properties  of  A  demonstrate,  that 


(6.5)  is  equivalent  to 


y  (B  x  +  (M/A)z)  «  0,  Vy  c  R(M/A) . 


Thus 


BT  X  +  (M/A)z  e  N(M/A)  , 


Z  +  (M/A)  B  X  €  N(M/A) . 


Consequently, 


SMnQM  -  ( ( x-A+Bz , z ) :  x  e  N( A)  and  Z  ♦  (M/A)+BTx  e  N(M/A) )  (6.6) 


.M^M 


Moreover,  SoS  n Q  *{£},  so  d(Q)a0.  Theorem  2.5  therefore 


allows  (6.3)  to  be  written  in  the  fora 


(6.7) 


InM  *  In*  (R(  A)  ;M)  +  In*  (Q ;M)  +  In*(SMnQM;M) 


It  has  already  been  noted  in  (6.4)  that  In  (R(A);M)  has  a 
particularly  simple  form.  It  is  next  shown  that 


In  (Q;M)  *  (»(M/A),  P(M/A),0). 


(6.8) 


To  appreciate  this  fact  calculate  that  the  restricted  quadratic 
form 


w  Mw,  we  Q, 

is  Identical  with 

( -A+By ,  y)T  M( -A+By ,  y) ,  y  e  R(M/A) 


which,  because  A+AA+  *  A+,  is  equivalent  to 


yT  (M/A)y ,  y  e  R(M/A) . 


As  before,  Definition  (2.2)  then  implies  (6.8) 


In  light  of  (6.4)  and  (6.8)  it  remains  to  analyze  the  term 


*  M  M 

In  (S  nQ  ;M)  appearing  in  (6.7).  To  this  end  consider  the 


quadratic  form 


z  (M/A)z,  z  e  SMnQM, 


which  is  equivalent  to 


(x  -  A+By ,  y)T  M(x-A+By ,  y) ,  y  *  -(M/A)+BTx  +  w, 

x  e  N( A) ,  w  e  M(M/A) . 


or 


2xtBw  -  xT  B(H/A)+Bt  x, 


x  e  N(A) ,  w  e  N(M/A) . 


From  this  last  expression  it  is  apparent  that 


In*(SMnQM;M)  -  In* (N(T) ;L) 


(6.9) 


where  the  symmetric  matrices  L  and  T  are  defined  by 


r-B(M/A)V  B 


B 


(6.10) 
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A'.**'  * 


and 


0  ‘ 
(M/A), 


(6.11) 


T 


A 

0 


The  determination  of  an  inertia  of  the  form  (6.9)  is  precisely  the 
question  of  restricted  quadratic  forms  that  will  be  treated  in  §7, 
and  either  of  the  two  methods  described  there  could  in  principle 
be  applied.  Moreover,  the  calculations  arising  are  not  as 
formidable  as  might  be  imagined,  for  Corollary  5.5  applies  to 
state  that 

In  L  =  (p,p,n+m-2p) , 


where  p  is  the  rank  of  B.  It  can  also  be  easily  verified  that 


L*= 


ro  b 

B+  -B(M/A)+BtJ 


Nevertheless,  that  course  is  not  pursued  here.  Instead,  a  direct 
assault  is  launched. 


Theorem  6.1.  Let  M  and  M/A  be  defined  as  in  (6.1)  and  (6.2). 
Then 

InM  =  In  A  +  In(M/A)  +  In*  (BT  N(A)oR(M/A)  ;-(M/A)  +  ) 

♦  (t,t,-t-dim(BT N(A)>)  (6.12) 

where 

t  =  dim{BT N(A)nN{M/A) )  (6.13) 

Proof .  The  matrix  L  defined  in  (6.10)  can  be  factored  as 

[B  0]  f-(M/A)+  l"j[BT  0 

L-[o  lit  I  oj[o  I 

Accordingly, 

In*  (N(T)  ;  -L)  «  In*(SN(T);L)  ♦  (0,0,  dim{N(i)r>N(T)  )  )  (6.14) 


*  B  LB 

■ 
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where  the  matrix  T  was  defined  in  (6.11).  But 


dim(N(i)nN(T) }  =  dim(H(BT )nN( A) )  ■  6(A)  -  dim(R(B)oN( A) ) 

*  6(A)  -  dim(BT  N(A) )  .  (6.15) 

Furthermore,  In  ( BN ( T ) ; L )  can  be  estimated  directly.  The  objec- 
tive  is  to  construct  a  L-orthogonal  basis  for  BN(T)  and  apply 

Lemma  2.2.  The  subspace  BN(T)  can  be  decomposed  as 

BN(T)  =  (BT  N(A)^R(M/A)  ,0}  ©  (BT  N(A)r>N(M/A)  ,  BT  N(A)"N(M/A)  ) 

©  (0,  N(M/A)p(BtN(A)  )x) 

*  S1  0  S2  ®  S3‘ 

It  is  apparent  that  the  subspaces  are  orthogonal.  It  can 

also  be  verified  that  they  are  L-orthogonal.  Consequently, 

In*  (BN(T)  ;L)  «=  In^S^L)  +  In*(S2;L)  +  In*(Sg;L).  (6.16) 

Now 

(0,x)T  L ( 0 , x)  «  0,  Vx  e  Rn, 

SO 

In*  ( S3  ;  L )  «  (0,0,  dim(N(M/A)^(BTN(A))x>) 

=  6 (M/A)  -  dim{BT N(A)nN(M/A) ) .  (6.17) 

Moreover,  as  was  described  in  the  proof  of  Lemma  5.1,  the  vectors 
(yi#  ±  y^)  are  eigenvectors  of  L  (with  corresponding  eigen¬ 
values  ±1)  whenever  y^  e  N(M/A).  Thus  the  subspace  S2  is 
mapped  into  itself  by  L,  and 

In*(S2;L)  -  ( t , t , 0 )  (6.18) 

where 

t  «  dim(BT N(A)0N(M/A) ) .  (6.19) 

The  best  that  can  be  said  about  the  remaining  term  is  that 
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(6.20) 


In  (S1;L) 


In  (B‘  N(A)nR(M/A) ;-(M/A)+) . 


Equations  (6.4),  (6.7),  (6.8),  (6.9)  and  (6.14)-(6.20)  pro¬ 
vide  (6.12)  and  (6.13)  as  required. 


Remarks. 

(a)  The  classic  result  In  this  area  concerns  the  case 
N( A)  *  (0),  in  which 

InM  «  InA  +  In(M/A) ,  (6.21) 

which  formula  is  due  to  Haynsworth  (1968).  Theorem  6.1 
demonstrates  that  the  condltons 

Bt  N(A)cR(M/A)  , 

and 

In*(BTN(A)  ;-(M/A)  +  )  *  (0,0,  dim(BTN(A)>) 

*  (0,0,  dim(R(B)r>N{A))) 

are  necessary  and  sufficient  for  (6.21)  to  hold. 

(b)  Carlson  et  al  (1974)  obtain  the  inequalities 

w(M)  *  n (A)  +  w (M/A) 

*>(M)  i  P(A)  +  V  (M/A) 

that  arise  from  (6.12)  when  the  last  two  terms  are 
discarded.  The  inequality 

6 (M)  s  6(A)  +  6 (M/A) 

is  then  immediate.  They  also  obtain  necessary  and 
sufficient  conditions  for  equality  in  these 
Inequalities,  namely: 


N(M/A)cN( (I-AA+)B) , *  &  (I“AA+)B(M/A)+BT  (I-AA*)  -  0. 

These  conditions  are  equivalent  to  the  necessary  and 
sufficient  conditions  for  equality  given  in  Remark  (a) 
above.  Actually,  the  interests  of  Carlson  et  al  are  not 
restricted  to  symmetric  partitioned  matrices,  and  the 
method  of  proof  they  use  is  correspondingly  more  general . 

(c)  Lemma  5.2  is  actually  a  special  case  of  Theorem  6.1. 

This  fact  is  not  immediately  obvious,  but  is  consequent 
upon  the  following  calculations.  When 
C  «  0,  (M/A)  ■  -BtA*B,  and 

dim(BTN(A)nR(-Bt  A+B))  s  <N(A)nR(A+B) )  *  0. 


Consequently,  BT  N( A)cN(M/A) ,  so  that 

In*(BTN(A)oR(M/A)  ;  -(M/A)+)  -  (0,0,0), 

and 

t  «  dim(BT  N( A)oN(M/A) }  *  dim  BTN(A) 

-  dlm(N( A)oR(B) ) .  (6.22) 


Then  (6.12)  becomes 

InM  -  InA  +  In(BT  (-A+)B)  +  (t,t,-2t).  (6.23) 

According  to  equation  (4.1),  In(BT  (-A+)B)  can  be  rewritten 
as  In* (Bt  ( -A+)B)  +  (0,0, n-p),  and  Corollary  4.3  can  be  applied  to 
eliminate  In* (BT  (-A+)B)  in  favour  of  a  term  In* (N(BT  ) ;-A) .  The 
relation  so  obtained  is 


*In  fact,  the  paper  of  Carlson  et  al  contains  a  typographical 
error  in  which  the  inclusion  sign  has  been  replaced  by  an 
equality. 


where 


But, 


and 


InM  ■  In  A  ♦  In(-A+)  -  In*(W(BT  )  :-A) 

♦  (t-d, t-d,  -  2 ( t-d)  ♦  n-p), 

d  -  di»{R{B)nAN(BT  )). 

InA  «•  In(-A+)  -  (w(A)  ♦  p(A),  «(A)  ♦  **(A),  26(A)) 

-  (a-6  (A) ,  a-6  (A) ,  26(A)) 

-6(A)  «  dia{N( A)nR(B) )  -  din(K(A)>  -  -  dia(H(A)oH(BT  ) ) . 


Consequently,  equation  (5.3)  la  recovered, 

Example  6.2.  Consider  the  aatrlx 

M  • 


0  0  0 
Oil 
11-1 
1-11 


Here 


and 


A  ■  [o  o]  '  8  ■  [l  l] 

*  [J  S]  •  «"/*>  * c  -  [-1  '5]  • 

BTN(A)  ■  H(M/A)  -  span  (1,1). 


Then  Theorem  6.1  reduces  to 

InM  -  (1,0,1)  ♦  (1,0,1)  ♦  (0,0,0)  ♦  (1,1, -2)  -  (3,1,0), 
and  It  can  be  verified  that  M  has  eigenvalues  ±/2,l,«. 


We  now  consider  a  problem  involving  partitioned  matrices  that 
was  originally  posed  by  Morse  (1972),  and  which  has  also  been  ana- 
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lyzed  In  terns  of  the  standard,  as  opposed  to  generalized,  Schur 
coapleaent  by  Cottle  (1974).  The  problea  Is  to  deteraine 

In* (N(T) ;M) 

where  the  (a<'n)’'(m)  aatrlx  M  Is  partitioned  as  In  (6.1)  and 

T  -  [A  B). 

That  Is  the  constraint  matrix  T  coincides  with  the  top  segment 
of  the  partitioned  matrix  M.  One  motivation  for  the  study  of 
such  problems  is  the  ainiaization  of  z  Mz  r  (x,y)TM(x,y)  with 
respect  to  the  variable  x  only. 

Corollary  2.7  Is  applied  with  S  ■  (P*,0).  Then 
SM  -  ((x.y):  Ax  ♦  By  -  0)  -  N(T) . 

Moreover, 

S^SM  -  (N( A) , 0) 

so  that  d*(S)  -  dla(S^SMoR(M) )  ■  dia(N( A)oR(B) } .  Consequently, 

It  aay  be  concluded  that 

InM  «  InA  ♦  In*(N(T);M)  +  (d ,  d ,  -d-6  ( A) )  (6.24) 

where 

d  -  dia(N(A)nR(B) )  »  dla(BTN(A)). 

Morse  (op.  clt.)  obtained  results  In  two  special  cases,  namely  M 
nonsingular,  or  A  nonsingular.  When  A  is  nonsingular,  (6.24) 
reduces  to 

InM  ■  InA  ♦  In*  (N(T)  ;M) ,  with  6(A)  -  0. 

As  was  pointed  out  by  Cottle  (op.  clt.),  this  case  is  particularly 
straightforward  because 


and  the  restriction  of  M  to  N(T)  coincides  with  the  quadratic 
fora  associated  with  the  natrlx  M/A.  That  is 

In* (N  (T) ;M)  ■  In(M/A) . 

The  second  case  considered  by  Morse  was  M  nonsingular. 

This  assumption  implies  that 

N( A)cR(B) , 

so 

d  -  6(A). 

Rearrangement  of  (6.24)  then  provides  the  equations 

MM)  -  KA)  ♦  t>*(N(T);M)  ♦  6(A), 

6(A)  -  6*  (N(T)  ;M)  , 

and 

«(M)  -  n  (A)  +  w*(N(T);M)  ♦  6(A), 

which  results  are  equivalent  to  the  relations  obtained  by  Morse 
and  rederlved  by  Cottle. 

In  point  of  fact,  the  main  thrust  of  Morse's  work  was  to 
obtain  an  expression  for  the  quantity  InM  -  In  A,  and  the 
introduction  of  the  restriction  of  M  to  N(T),  was  one  natural 
way  to  find  such  expressions.  However,  Theorem  6.1  provides  a 
different  formula  for  InM  -  In  A.  Of  course,  (6.24)  and  (6.12) 
are  closely  related.  They  coincide  exactly  in  the  case  that  A 
is  Invertible.  However,  in  general,  the  two  equations  provide 
distinct  Information. 

Other  results  are  obtained  if  the  quantity  InM  -  In  A  is 
eliminated  from  (6.12)  and  (6.24)  to  obtain  an  expression  for 
In  (M(T);M).  Me  shall  only  consider  the  case  C  ■  0,  in  which 


InM  -  In  A  can  be  eliminated  between  (6.23)  and  (6.24),  to 
obtain 


where 


But 


In*(N(T);M)  -  In(BT(-A+)B)  ♦  (0,0,  6(A)-t) 
t  -  dla(N( A)nR(B) ) . 

In(B’ (-A+)B)  «  In*(BT (-A*)B)  +  (0,0,  n-p) 
-  In* (R(B) ; -A+)  +  (0,0,  n-p) 


(6.25) 


and,  as  In  equation  (3.7), 

In* (R(B) ;-A+)  «  In*(A+H(BT  ) ; —A )  «  In* (N(BT ) A; -A) 

-  (0,0,  dim(N(A)nN(BT  )). 


Substitution  into  (6.25)  then  yields 


Lesaa  6.3.  Let  the  ( a+n ) * ( m+n )  symmetric  matrix  M  be 
partitioned  as 


where  B  is  ax  n  of  rank  p.  Let 

T  «  [A  B). 

Then 

In*(N(T);M)  -  In*(N(BT  )A;-A)  +  (0,0,  n-p)  (6.27) 

Lemma  6.3  was  first  obtained  by  Han  (1986,  Theorem  2.2)  in 
his  investigations  concerning  the  Wolfe  dual  that  arises  in 
nonlinear  programming. 

When  M  is  of  the  fora  (6.26)  we  have  obtained  three  expres¬ 
sions  for  InM,  namely  (5.3),  (6.23)  and  (6.24).  It  has  already 
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been  r«urk*d  that  aquations  (6.23)  and  (5.3)  are,  in  a  cartain 
Sanaa,  dual.  If  InM  is  eliminated  between  (5.3)  and  (6.24), 
rather  than  between  (6.23)  and  (6.24)  we  obtain,  after  soae 
calculation, 

In*(N(T);M)  -  In(-A)  -  In* (H(BT ) ;-A)  ♦  (-d*,-d#,d*  +  d  ♦  n-p) 
where 

d  -  din{N(BT  )oN(BT  )A)  -  dis(M(A)rN(BT  ))  ♦  d°(N(BT)) 

and 

d*  (N(Bt  ) )  *  d*  «  din  (A  N(BT)nR(B)). 

As  was  to  be  expected  this  last  result  can  be  obtained  directly 
from  (6.27)  by  use  of  Corollary  2.7  to  eliminate  the  tars 
In*  (N(Bt  ) A; -A) . 

As  a  final  comment  it  is  instructive  to  notice  that  the 
proofs  of  (6.12)  and  (6.24)  both  start  with  an  application  of 
Corollary  2.7  to  the  matrix  M.  However,  the  choice  of  subspace 
S  is  different.  When  S  ■  (R(A),0)  the  analysis  leads  to 
(6.12).  When  S  -  (**,0),  (6.24)  is  obtained. 

§7.  Tests  for  Positlvetv  of  Restricted  Quadratic  Forms 

The  preceding  development  has  exploited  the  equivalence  of 
symmetric  matrices  and  quadratic  forms,  but  the  main  focus  has 
been  on  inertia  theorems  for  matrices.  Emphasis  is  now  switched 
to  consideration  of  quadratic  forms.  In  particular,  we  shall 
consider  tests  that  guarantee  either 

xT  Ax  >  0  Vx  «  8\(0)  (7.1) 
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or 


X*  Ax  2  0  Vx  e  S .  (7.2) 

Hero,  as  before,  A  is  an  a**  symmetric  matrix  and  S  is  a 
subspace  of  R*.  It  is  further  assumed  that 

S  -  N(BT  )  (7.3) 

where  B  is  an  m»m  matrix  of  rank  p. 

One  motivation  for  the  study  of  conditions  such  as  (7.1)  and 
(7.2)  arises  in  constrained  nonlinear  programming.  Then  A  is  to 
be  interpreted  as  the  Hessian  of  the  Lagrangian,  and  B  is  the 
transpose  of  the  matrix  that  is  the  gradient  of  the  constraints. 
Details  can  be  found  in  many  references,  for  example,  Hestenes 
(1966) . 

In  the  notation  of  this  development,  condition  (7.1)  is 
equivalent  to 

In* ( S ; A)  -  (a-p,0,0) ,  (7.4) 

and  (7.2)  is  equivalent  to 

*MS;A)  -  0.  (7.5) 


It  is  apparent  that  the  preceding  theory  is  relevant,  but  that 
theory  merely  translates  conditions  such  as  (7.4)  or  (7.5)  into 
different  criteria.  The  immediate  question  that  arises  is  whether 
any  new  criterion  is  actually  easier  to  verify  than  the  original 
condition.  Concomitantly,  before  a  proper  discussion  of  the  pro¬ 
blem  can  be  undertaken,  some  assumptions  must  be  made  as  to  what 


(I)  If  a  concrete  expression  for  a  matrix  is  known,  its 
inertia  can  be  calculated. 

(II)  Calculations  of  inverses,  or  generalized  Inverses,  are 
to  be  avoided. 

According  to  these  maxims,  calculation  of  InBTAB,  which 

♦  * 

Involves  knowledge  of  In  B  AB  and  the  rank  of  B,  would  be 

♦  Y 

preferred  to  calculation  of  In  (N(B  ) ;A) .  This  is  because  the 

♦  y 

concrete  matrix  whose  inertia  is  associated  with  In  (N(B  );A)  is 
( I-BB+)A( I-BB+) ,  which  matrix  must  first  be  found. 

The  theory  described  in  this  work  offers  two  indirect  ap¬ 
proaches  to  the  verification  of  conditions  such  as  (7.1)  or  (7.2). 

The  first  indirect  approach  is  offered  by  Corollary  2.7,  or  one  of 

its  mutations  involving  A4.  For  example,  equations  (3.1),  (4.1)  | 

and  Corollary  (4.1)  provide  the  equation 

In* (N(Bt  ) ; A)  -  In  A  -  In  BT A+B  +  (d,d,  n-p-2d)  (7.6) 

where 

d  »  dlm{R( AB)nN(BT  )).  (7.7) 

^  T 

Thus,  conditions  (7.4)  and  (7.5)  on  In  (N(B  );A)  can  be  trans- 

#  -  ( 

lated  into  conditions  on  In  A,  In  (R(B);A  )  and  d.  According  to 

assumption  (i)  above,  it  is  easier  to  calculate  In  A  than  to 
calculate  In* (N(BT  ) ; A) .  However,  two  other  quantities  remain  to 
be  determined,  and  the  generalized  Inverse  A4  appears.  Never¬ 
theless,  this  first  indirect  approach  was  successfully  employed  in 
an  analogous  infinite-dimensional  problem  considered  by  Maddocks 
(1985).  The  crucial  feature  of  that  example  was  that  while  S  »  N(BT  ) 
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was  of  infinite  dimension,  SA  was  of  low  dimension.  Consequent- 

•  + 

ly,  calculation  of  In  (R(B);A  )  was  reduced  to  the  solution  of  a 
small  number  of  inhomogeneous  equations  of  the  form 

Axi  *  bi' 

where  is  a  basis  for  R(B).  The  analysis  of  the  problem  of 

Morse  that  was  presented  at  the  end  of  §6  provides  another  example 
of  this  first,  indirect  approach  to  the  determination  of 
In*  (N(Bt  ) ;  A)  . 

The  second  indirect  approach  is  provided  by  Lemma  5.2,  which 
may  be  rewritten 

In* (N(Bt  )  ;-A)  -  (m-d,  m-d,  n-p+2d)  -  InM.  (7.8) 

where 

d  «  dim(N(BT  )nN(BT  )A)  -  6*(N(BT  );-A)  (7.9) 

and  M  is  the  bordered  matrix  (cf.  Cottle,  1974,  Chabrillac  & 
Crouzeix,  1984) 


It  should  here  be  remarked  that 

|/(H(BT  )  ;-A)  «  n*(N(BT  )  ;A)  ,  n*(H(BT);-A)  «  v*  (N(BT  )  ;A)  , 

6*(N(BT  ) ; -A)  -  6*(N(Bt  )  ;A) , 

•  y 

so  that  (7.8)  does  provide  an  expression  for  In  (N(B  );A).  Actu¬ 
ally,  from  the  point  of  view  of  this  development  it  would  be  more 
natural  to  Introduce  a  bordered  matrix  with  upper  left  element 
-A,  but  that  approach  is  not  in  accord  with  previous  works. 


According  to  our  assumptions,  calculation  of  InM  Is 
straightforward,  albeit  that  M  Is  an  (m+n)* (m+n)  matrix  rather 
than  a  m*m  matrix.  The  degree  of  complexity  Involved  In 
calculation  of  d  is  not  immediately  apparent.  However,  the 
particular  structure  of  (7.8)  makes  this  Issue  mute  for  we  have 

Theorem  7.1.  Let  S  and  M  be  defined  as  in  (7.3)  and  (7.10). 
Then  M  has  at  least  p  positive,  p  negative  and  (n-p)  zero 
eigenvalues.  Moreover, 

(I)  property  (7.1)  holds  if  and  only  if  n(M)  *  m; 

and 

(II)  property  (7.2)  holds  if  and  only  if  v(M)  *  p. 

This  result  is  due  to  Chabrlllac  &  Crouzelx  (1984,  Theorem 
1).  It  also  follows  from  (2.2),  (7.4),  (7.5),  (7.8)  and  (7.9). 


It  should  be  remarked  that  while  the  two  lines  of  attack 
suggested  here  are  in  some  respects  similar,  they  also  possess 
intrinsic  differences.  The  first  method  is  based  upon  A-orthogo- 
nallty  in  Fn:  the  second  method  is  based  upon  M-orthogonallty  in 


The  theory  of  the  bordered  matrix  has  further  ramifications. 
For  example,  Han  (1986)  discusses  its  role  in  the  theory  of  Wolfe 
duality  in  nonlinear  programming.  Another  potentially  interesting 
observation  is  that  in  problems  where  the  subspace  S  of  (7.1)  or 
(7.2)  is  defined  in  terms  of  the  range  of  B,  equation  (6.23)  can 


be  used  as  the  basis  of  a  result  involving  a  bordered  matrix  with 


top  left  entry  A  . 


39 


Finally,  it  should  be  stressed  that  there  are  approaches  to 
conditions  (7.1)  and  (7.2)  other  than  the  inertia,  or  index, 
theory  described  here.  For  example,  there  are  tests  involving 
determinants,  and  tests  based  upon  penalization  or  augmentability 
methods.  Chabrillac  &  Crouzeix  (1984)  survey  several  of  these 
techniques . 

§  8 .  Summary  &  Discussion 

* 

The  first  part  of  this  work  introduces  an  inertia  In  (S;A) 
that  is  associated  with  a  pair  comprising  a  subspace  S  and  a 
symmetric  matrix  A.  The  main  result  is  Theorem  2.6  which  states 
relations  between  In*(S;A)  and  In*(SA;A),  where  SA  is  the 
subspace  that  is  the  A-orthogonal  complement  of  S.  This  theory 
essentially  comprises  a  specialization  to  finite-dimensions,  and 
an  extension  of,  a  general  Hilbert  space  theory  developed  by 
Hestenes  (1951)  and  Maddocks  (1985). 

The  definition  and  properties  of  In  (S;A)  are  nontrivial 
precisely  because  the  matrix  A  need  not  map  the  subspace  S  to 
itself.  Indeed  the  quantity 

d° ( S ; A )  «  dim{SnSAnR( A) ) , 

known  as  the  relative  nullity  of  A  on  S,  plays  a  central  role 
In  this  analysis.  Han  &  Fujiwara  (1985)  developed  a  comparable 
f inite-dimensional  theory  under  additional  hypotheses  guaranteeing 
that  the  relative  nullity  d*(S;A)  vanishes.  A  corollary  of  the 
analysis  given  here  is  that  d*(S;A)  vanishes  precisely  when 
can  be  written  as  the  sum  S+SA. 
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The  analysis  of  §§2  and  3  is  applied  in  §§4,5  and  6  to  obtain 


results  concerning  the  (standard)  inertias  of  symmetric  matrices 
with  particular  structures.  The  first  results  (Corollaries  4.1  - 
4.3)  concern  matrices  of  the  form  BTAB.  The  statement  of  Corol¬ 
lary  4.1  is  rather  complicated,  but  it  is  the  direct  generaliza¬ 
tion  to  matrices  B  not  of  full  rank,  of  Sylvester's  Law  of 
Inertia. 


Sections  5  and  6  treat  symmetric  partitioned  matrices 
the  form 


M  of 


(8.1) 


The  main  result  for  such  matrices  is  Theorem  6.1,  which  relates 
the  inertias  of  the  matrices  M,  A  and  M/A,  the  generalized 
Schur  complement  of  A  in  M.  The  statement  of  Theorem  6.1  is 
again  somewhat  complicated,  but  it  subsumes  inequalities  obtained 
by  Carlson  et  al  (1974),  and,  in  the  case  of  A  being  invertible, 
an  inertia  theorem  of  Haynsworth  (1968)  is  recovered. 


Several  of  the  results  on  partitioned  matrices  concern  the 
case  C  *  0 ,  i  .  e . 


M  * 


(8.2) 


For  such  matrices  three  different  expressions  for  the  inertia  of 
M  are  obtained,  namely  equations  (5.3),  (6.23)  and  (6.24).  Equa¬ 
tion  (5.3}  includes  a  widely  known  result,  here  stated  as  Corol- 

i 

lary  5.4,  in  which  B  has  full  rank.  Chabrillac  &  Crouzeix  j 

(1984)  obtained  a  result  closely  related  to  equation  (5.3),  but  ! 

t 

they  state  the  result  in  terms  of  properties  of  quadratic  forms  j 


(vide  infra).  Han  &  Fujiwara  (1985)  obtain  an  expression  equiva¬ 
lent  to  (5.3)  in  the  special  case  that  the  quantity  d,  defined 
in  (5.4),  vanishes. 

Equation  (5.3)  is  here  derived  directly,  whereas  equation 

(6.23)  is  obtained  as  a  special  case  of  Theorem  6.1.  However, 
equations  (5.3)  and  (6.23)  are  also  shown  to  be  dual  in  a  sense 
associated  with  the  A-orthogonallty  theory  developed  in  §§2  and  3. 

Equations  (5.3)  and  (6.23)  involve  various  restricted  iner¬ 
tias  of  A,  but  only  the  unrestricted  inertia  of  M.  However, 
equation  (6.24)  involves  the  inertia  of  H  restricted  to  the 
nullspace  of  the  m*(»+n)  matrix 

T  «  (A  B). 

When  the  triple  In M  is  eliminated  between  equations  (6.23)  and 

(6.24)  further  information  is  obtained,  which  is  here  formalized 
as  Lemma  6.3.  This  result  is  originally  due  to  Han  (1986),  who 
exploited  it  in  the  context  of  the  Wolfe  dual  that  arises  in 
nonlinear  programming. 

Equation  (6.24)  remains  valid  in  the  case  C  0  0.  It  then 
encompasses  results  of  Morse  (1971)  and  Cottle  (1974)  that  apply 
in  either  of  the  cases  of  A  or  M  being  nonsingular. 

The  properties  of  restricted  quadratic  forms  are  considered 
in  §7.  Two  approaches  to  tests  for  positivity  of  a  quadratic  form 
xT  Ax  on  a  subspace  S  *  N(B‘  )cRm  are  described.  The  first  test 
exploits  A-orthogonallty  in  Rn.  The  second  test,  originally  due 
to  Chabrillac  &  Crouzelx  (1984),  Involves  the  ( m+n ) * ( m+n )  bordered 


matrix  M  of  the  form  (8.2).  This  last  test  is  here  shown  to  be 

B+n 

a  consequence  of  M-orthogonality  in  R 

It  has  already  been  remarked  that  the  essence  of  the  theory 
developed  in  §§2  and  3  can  be  obtained  by  specialization  to 
finite-dimensions  of  a  more  general  Hilbert  space  theory.  As  is 
detailed  in  §§4  to  7,  this  descent  from  infinite-dimensions  allows 
both  an  extension  and  a  unification  of  known  finite-dimensional 
results.  Contrariwise,  few  of  the  proofs  utilized  here  are  intrin¬ 
sically  f inite-dimensional ,  and  some  of  the  known  matrix  results 
now  indicate  potentially  viable  routes  to  new  theorems  valid  in 
Hilbert  space.  For  example,  the  bordered  matrix  (8.2)  is  the 
direct  analogue  of  certain  systems  of  linear  differential  equa¬ 
tions  that  arise  in  the  study  of  the  second-variation  in  the 
isoperimetric  calculus  of  variations.  (see,  for  example, 

Maddocks,  1985, §3). 
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